Grid Dynamization

نویسنده

  • Z. Vlčková
چکیده

Currently the emphasis is laid on the update rate of indices working with huge data sets, e.g. with all the Internet pages. This article addresses the questions of inverted indices generation, of static data structuers dynamization and finally of dynamic index distribution on more nodes. In each step of upgrading such data structures we describe what changes have to be done and describe benefits (eventually disadvantages) of recently developed data structures. The goal of this paper is to show experimentally that a dynamic inverted index updating distributed to more nodes is quicker then a static one and we can save up to tens percent of read and write operations. Introduction Search engines are systems trying to get answers of questions like “Which documents contain words...” from stored data. We focus on search engines of Internet pages that are more and more significant, so it is important to optimize their search quickness and amount of spent resources. When entering the query, the search engine returns documents sorted by specific criterion (generally a rating function is used, e.g. Google’s PageRank [3; 9]). To optimalize search rate of relevant documents when evaluating the query, search engines use index [4]. Without an index the search engine would have to go trough all the documents what would take a lot of time and computational power in vail. According to [12] there are more then 2.5 billion available pages on Internet. It is unthought to have a search engine without index, going trough all documents would take hours then. Price we have to pay for searching speed is the necessity of periodical index updating and this is the operation which can disallow effective work with an index for a long time. Step-by-step we will build an indexing algorithm of search engine and optimize it. In chapter Inverted index we will describe the procedure of creating an inverted index. In the following chapter Dynamization we attend to dynamize a static data structure and describe dynamic structure characteristics. The chapter Index maintenance details used data structures and their properties like they are applied in Egothor (see [7]). In chapter Index distribution we describe another opportunity how to improve an index actualization properties, and that is by data structures distribution on different nodes. In chapter Simulation we give an account of the simulation program and researched quantities. The chapter Experiments describes the outputs of the simulation program. The last chapter summaries the conclusion resulting from the simulation outputs. Inverted index The inverted index construction can be divided into two steps [6]: First we build a forward index containing all significant words of documents, then we convert it to an inverted index. Consequently the inverted index maps all these words to positions in every document of input set of data files. This data structure provides fulltext search and therefore it is used in data processing systems like search engines. In practice, inverted index is built directly (without the forward index). According to the vector model we define the term weight indicating its significance. All the WDS'07 Proceedings of Contributed Papers, Part I, 116–124, 2007. ISBN 978-80-7378-023-4 © MATFYZPRESS

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributed Dynamic Data-structures for Parallel Adaptive Mesh-reenement

This paper presents the design and implementation of dynamic distributed data-structures to support parallel adaptive (multigrid) nite diierence codes based on hierarchical adaptive mesh-reenement (AMR) techniques for the solution of partial diieren-tial equations. The abstraction provided by the data-structures is a dynamic hierarchical grid where operations on the grid are independent of its ...

متن کامل

Distributed Dynamic Data Structures for Parallel Adaptive Mesh Re nement

This paper presents the design and implementa tion of dynamic distributed data structures to sup port parallel adaptive multigrid nite di erence codes based on hierarchical adaptive mesh re nement AMR techniques for the solution of partial di eren tial equations The abstraction provided by the data structures is a dynamic hierarchical grid where oper ations on the grid are independent of its di...

متن کامل

Nail Dynamization for Delayed Union and Nonunion in Femur and Tibia Fractures.

Femoral and tibial shaft fractures are common injuries in the United States. Since their introduction, reamed intramedullary nails have become the treatment of choice for most of these fractures. However, delayed union and nonunion can complicate treatment in up to 10% of patients. Removal of interlocking screws, or dynamization, can promote fracture healing in cases of delayed union or nonunio...

متن کامل

Comparison of 4 Methods for Dynamization of Locking Plates: Differences in the Amount and Type of Fracture Motion

BACKGROUND Decreasing the stiffness of locked plating constructs can promote natural fracture healing by controlled dynamization of the fracture. This biomechanical study compared the effect of 4 different stiffness reduction methods on interfragmentary motion by measuring axial motion and shear motion at the fracture site. METHODS Distal femur locking plates were applied to bridge a metadiap...

متن کامل

Dynamization in IR Systems

The majority of today’s IR systems base the IR task on two main processes: indexing and searching. There exists a special group of dynamic IR systems where both processes (indexing and searching) happen simultaneously; such a system discards obsolete information, simultaneously dealing with the insertion of new information, while still answering user queries. In these dynamic, time critical tex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007